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1.  Introduction 


Let  us  consider  two  well-known  stationary  processes  with  memory  [y  ,  X,  R]  and 

o 

[y„,  Y,  R] ;  where  y  and  y  the  corresponding  measures,  X  and  Y  the  respective  names 
1  o  1 

of  the  processes,  and  R  the  real  line  on  which  both  processes  take  values.  Let  both 

processes  be  discrete-time  and  let  w^;  j  >  i  denote  an  observed  data  sequence 

Let  the  observations  start  at  time  zero,  and  let  it  be  known  that  the 

Initially  active  process  is  [y  ,  X,  R] .  Let  it  be  possible  that  at  some  point  in 

o 

time,  t,  the  process  [y  ,  X,  R]  may  become  inactive,  and  that  the  process  [y  ,  Y,  R] 

o  1 

may  be  active.  Instead,  and  remain  so.  Let  the  two  processes  be  mutually  independent, 

and  let  the  time  t  above  be  allowed  to  take  any  nonnegative  integer  values  0,1 . . 

that  is,  the  possibility  that  the  process  [y^,  Y,  R]  becomes  active  at  time  zero  is 

Included.  We  consider  the  problem  of  formulating  a  test,  that  detects  the  [y  ,  X,  R] 

o 

to  [y^,  Y,  R]  change  (if  and  whenever  such  change  may  occur),  accurately,  and  fast. 

The  problem  of  detecting  the  possible  change  from  a  given  process  to  another 

given  process,  has  numerous  applications,  ranging  from  industrial  quality  control,  to 

edge  detection  in  linages,  to  the  diagnosis  of  faults  in  the  elements  of  computer- 

communication  networks.  The  case  where  both  the  processes  [y  ,  X,  R]  and  [y  ,  Y,  R] 

o  1 

are  memoryless  has  been  fully  analyzed.  Zacks  and  Kander  [13]  proposed  an  ad  hoc 
procedure  where  fixed  size  n  data  blocks  are  collected,  and  based  on  those  data  a 
decision  as  to  change  is  attempted.  However,  Page  [7],  [8]  was  the  first  to  propose 
a  sequential,  and  one-step  memory  algorithm,  for  this  case.  Lorden  [6]  proved  that 
Page's  algorithm  is  also  asymptotically  optimal,  in  a  sense  that  will  be  explained 
later.  In  [9],  [10]  Page's  algorithm  has  been  successfully  applied  in  the  effective 
Identification  of  faults  in  the  links  of  Computer-Communication  networks. 

Although  the  problem  of  detecting  a  change  in  the  active  process  has  been  fully 
analyzed  in  the  memory less  case,  no  consideration  has  been  given  to  the  case  where 
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the  two  processes  [yQ,  X,  R]  and  [y^,  Y,  R]  are  stationary  with  memory.  In  this 
paper,  we  undertake  this  task.  We  will  first  present  insight  to  our  formalization. 
Then,  we  will  propose  a  sequential  test.  Finally,  we. will  prove  the  asymptotic 
optimality  of  our  test,  in  a  sense  that  will  be  explained. 

3.  Preliminaries 

Let  R  be  the  real  line,  and  let  8  be  the  Borel  a-field  generated  by  open  intervals 

on  R.  Then,  (R,  8)  is  a  measurable  space.  Let  Rn  and  8°  denote  n-tuples  of  R  and  8 

respectively.  Let  yn  be  a  probability  measure  defined  on  (Rn,  if1).  Then,  yn  has  a 
00  00  00  00  00 

unique  extension  y  on  (R  ,  8  ) .  Let  y^  and  y^  be  two  stationary  and  ergodic  proba- 

OO  00 

bility  measures  defined  on  (R  ,  8  ) .  Let  the  Kullback-Leibler  information  I  .  (n) 

ylyo 

exist  for  all  n;  that  is,  let  (1)  below  be  true.  Then,  y^  is  aboslutely  continuous 
with  respect  to  y”,  for  all  n. 


.  r  d  y" 
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00  00 

Given  probability  measures  y  and  y,  as  above,  let  it  be  known  that  the  measure 

o  1 

oo 

11  is  initially  active.  Let  it  be  possible  that  at  some  time  instant  m:  m  >_  0,  the 
o 

00  00 

measure  y  may  become  inactive,  and  the  measure  y  may  become  active  instead.  Thus, 
o  1 

given  an  observation  sequence  w”;  n  >_  0,  it  is  possible  that  an  initial  portion  w™ 

oo  n 

of  the  sequence  has  been  generated  by  the  measure  y  ,  and  the  remaining  portion  w 

o  nrrl 

00 

has  been  generated  by  the  measure  y^;  where  -1  <  m  <  n;  if  m  =  -1,  the  total  sequence 
n  09 

w  has  been  generated  by  the  measure  y, ;  if  m  ■  n,  the  total  sequence  has  been 
o  1 

OO 

generated  by  the  measure  y  .  Given  some  observation  sequence,  our  objective  is  to 

o 

00  00 

decide  if  the  y  to  y  change  has  occurred.  Since  the  very  occurence,  and  even  more 
the  time,  of  such  a  change  is  uncertain,  any  decision  test  that  we  may  develop  should 
be  clearly  sequential.  To  gain  insight  into  the  problem,  however,  we  will  initially 
assume  that  a  fixed  length  n+1  observation  sequence  w"  is  available;  where  n:  0<n£°°. 
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Let  n  be  a  nonnegative  finite  integer.  Let  w“  be  an  observation  sequence,  and 

oo  oo 

let  tbe  measures  y  ,  u  be  mutually  independent.  Given  w  ,  we  want  to  make  a  decision, 
o  1  o 

regarding  the  possibilities: 


generated  by  y 


wmfl  g®11®*®*®^ 


;  For  all  m  :  -1  <  m  <  n 


(2) 


For  fixed  n,  the  formalization  (2)  is  a  hypothesis  testing  problem,  with  n+2 

00  00 

hypotheses.  Let  us  denote  by  the  hypothesis  that  the  y^  to  y^  change  occurred 

just  after  the  mth  datum;  where  H_^  denotes  the  hypothesis  that  the  total  sequence 
n  <® 

w  has  been  generated  by  the  measure  y, ,  and  where  H  denotes  the  hypothesis  that  the 
o  in 

a  oo  oo 

total  sequence  wq  has  been  generated  by  the  measure  y^.  Let  both  the  measures  yQ 
and  y“  have  density  functions,  and  let  fQ  |  wi J j  fl(Wl|«j)  1  3- k  denote 

^  oo  ^  oo 

conditional  density  functions,  induced  by  the  measures  yQ  and  y^  respectively. 

Then,  assuning  equally  probable  hypotheses,  the  optimal  Bayesian  formalization 
results  In  the  following  test: 

Decide  in  favor  of  H  if: 


^  fi  KlVa»l)_ 
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«k+l 


[i  Klwk«) 
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o  ' 
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;  where 


(3) 


12  ^  i 

f„(“i 


f  (if  lw  1)  -  f  (w) 
o  \  ol  o  /  o  o 


flf  .flftn)  ■  (1  <’V1) 

fiKI 1-1 


i»n+l 


n+1 


i-1 


Given  the  density  functions  f,  and  f  ,  let  us  define  the  following  two 

i  o 


statistics , 


®  f  (y  y^  . 

<»”>  4  ^  (  Y>g 

f0(uik  )' 


f .  (w. Iwi-1) 
1 _ il  0 


,(w“) a 

\ fit,  v»tK 


Given  an  infinite  data  sequence  w,  the  natural  sequential  extension  of  the  test 

in  (3)  includes  the  statistic  T’(wn)  in  (4),  and  a  threshold  y,  and  it  consists  of 

n  o 

a  stopping  variable  N’ (w) ,  defined  as  follows. 


N^(w)  £  inf {n  :  T^(w")  >  y}  (6) 

Given  w,  the  sequential  test  stops  then  at  N^(w),  and  it  is  decided  that  the 


OO  09 


to  y^  change  has  occurred. 

From  the  definitions  of  the  statistics  T'(wn)  and  T  (wn)  in  (4)  and  (5),  it  is 

no  no 

dear  that  an  appropriate  5  threshold  exists,  such  that  the  stopping  variable. 


N~(w)  ■  inf{n  :  T  (wn)  >  6} 
o  no  — 


maps  the  stopping  variable  N^(w)  in  (6).  The  sequential  test  determined  by  the 
stopping  variable  (w)  in  (7)  has  recursive  properties  that  the  test  determined 
by  lT(w)  does  not  possess.  Indeed,  it  is  easily  concluded  from  (5)  that  the 
following,  recursion  holds. 


)  "  mAX  (°*  Tn(wJ  +  Sn(W«  )) 

n+l  o  n  o  n  o 


;  where 


5 


It  is  clear  from  expressions  (8)  that  the  test  determined  by  the  stopping  rule 

oo  oo 

Nj(v)  is  one-sided.  For  arbitrary  stationary  processes  yQ,  y  ,  the  recursions  in 
(8)  require  that  the  whole  data  sequence  w”  be  kept  in  memory.  If  the  processes 


CO 

l 

O 


yQ,  y^  are  £-order  Markov,  only  an  Z- size  memory  is  required.  If  the  processes  y^ 
and  y“  are  memory less,  no  memory  is  needed.  In  the  latter  case,  the  two  stopping 
variables  in  (6)  and  (7)  are  identical  (for  y  -  6),  and  either  one  is  as  in  [7],  [8], 
and  [6].  Lorden  [6]  proved  that  then,  the  sequential  test  determined  by  the  stopping 
variable  N^(w)  in  (7)  is  asymptotically  optimal;  that  is,  for  6  -*•  »,  we  then  have. 


E  {N.(w)}  >  5 
yo  6 


(9) 


E  (N6(w)}  ~  \Zog  6 1  *  I~ 


-1 


(10) 


;  where 


A 

Il'EU1{&8f(H)) 
1  o 


(11) 


and  any  stopping  rule  N(S)  that  satisfies  (9),  is  such  that  E  (n(5)}  >  E  {N-(w)J, 

M1  “  yi  6 

Qualitative  speaking,  Lorden' s  result  says  that  when  the  two  stationary  processes 

00  oo 

y  and  y.  are  also  memory less,  then,  the  stopping  rule  N,(w)  in  (7)  gives  for  6  *►  ® 
OX  0 

OO  00 

the  minimum  possible  expected  time  from  the  occurence  of  the  y  to  y^  change  (given 

that  this  change  occurred)  to  its  detection,  among  all  the  possible  stopping  rules 

that  satisfy  the  false  alarm  bound  in  (9). 

In  this  paper,  we  consider  the  case  where  the  two  processes  y°°  and  y”  are 

o  1 

stationary  with  memory,  and  ergodic.  We  then  propose  the  test  determined  by  the 
stopping  variable  N^(w)  in  (7).  We  will  prove  asymptotic  optimality  of  the  test  in 
a  precise  fashion,  for  processes  possessing  certain  properties.  We  will  derive 


bounds  on  the  performance  induced  by  one-sided  tests,  and  we  will  show  that  those 
bounds  are  attained  by  the  stopping  rule  Ng(w)  in  (7),  if  the  above  properties  are 

% 

ir  satisfied. 

*  3.  Performance  Bounds  for  One-Sided  Tests 

In  this  section,  we  will  establish  performance  bounds  on  one-sided  sequential 


7 


is  Che  union  of  Che  above,  N  is  an  extended  stopping  variable.  Also,  (IS)  holds 

since 

\tN‘+l“)  £  yN.I-r1}  -  E^{Nm)  -  E^H). 

To  prove  expression  (14),  define  as  in  [6], 

(i  ;  \  <  “ 

km\  ;  k-1 

\  0  ;  ®  00 


Since  vtQ  is  ergodic,  we  have. 


•fim  n”1  L  -  E  {£,}*P  (N  <  ")  <  a  a.e.(P  ) 
n*»  JLt  ^  si  U  1  -  Vn 


k-1 


(16) 


If  E  {N*}  ■  ",  (14)  is  trivial.  Let  E  {N*}  <  ",  let  N*=  0,  and  let  N*;  i  >.  1 

®  *■ 


be  such  t±  , 


K  <  <+1  ;  v  i 

If  -  n,  apply  N  to  wn+r,  vn+rfl,  ...,  for  all  r's, 

and  define  N*  as  the  first  time  that  stopping  occurs. 

Then,  N*  ■  N*,  and  the  variables  N  -  N*  are  identically  distributed,  for 
1  m  m-1 

different  m  values,  no  less  than  two.  Since  £jj*+r  "  1  for  some  r,  that  causes 

*  ®  r 
stopping  at  N  , ,  we  have, 
m+1 


^N*+l  +  . . .  +  >  l 

m  m+1 


+  ...  +  Cjj*  _>  m  ;  m 
m 


and 


(17) 


Thus, 


+  + 

m  .  in  .  _ 

- —  0 


N 


m 


N 


Due  to  (16),  the  left  hand  part  of  (17)  Is  bounded  from  above  by  cx,  as  m  00 . 

The  right  hand  part  of  (17)  converges  to  E  ^{N*},  as  m  •+■  °°,  by  the  ergodic  theorem. 


We  point  out  here,  that  if  the  extended  stopping  variable  N  is  determined  by 
the  statistic  ^  8n(wn);  where  g  (w^)  is  given  by  (8),  and  the  threshold  is  a  \ 
then  expression  (12)  in  theorem  2  is  satisfied  [11].  Thus,  theorem  1  applies  to  the 
stopping  variable  Ng(w)  in  (7),  if  we  put  6 -a"1.  The  stopping  variable  Nfi(w)  is 
exactly  N*  in  (13),  for  N  determined  by  the  statistic  X)  8  (w11) .  Thus,  we  have. 


n>C 


E  (N^(w)}  _>  6 


(18) 


for 


E  (N.(w)}  <  E  {n} 
yl  6  yi 


N  -  inf 


jn  :  J^giCw^)  >  6  | 

*  i=>0  ' 


(19) 


(20) 


Due  to  theorem  1,  and  the  conditions  (18),  (19),  and  (20),  satisfied  by  the 

stopping  variable  Ng(w)  in  (7),  we  will  now  focus  our  ultimate  objective  on  showing 

that  for  6  -*•  »,  and  under  certain  conditions,  E  (N.(w)}  is  the  infimum  among  all 

y.  o 

■>  ^  ^ 

the  expected  values  E  (N  };  where  N  is  any  extended  stopping  variable  that  satisfies 

yl 

the  condition  E^  (N*J  j>  6.  Thus,  we  will  show  that  the  stopping  variable  N.(w)  is 
then  optimal  in  the  above  sense.  Our  approach  will  be  as  follows.  We  will  first 
find  an  tipper  bound  bn  E^  (Ng(w)},  for  Then  we  will  show  that  this  upper 


bound  can  not  be  smaller  asymptotically  (5  -*■  °°)  than  {n*} ;  for  any  extended 

stopping  variable  satisfying  E  {N*}  _>  6,  and  under  the  appropriate  assumptions. 

^o 

Let  us  define, 

A  1  fl<Wo _1) 

L  -  n  1  *og  1  ~T  (2D 

"  f  <vn  X) 

o  o 


I1A  -  lim  L 
10  n^oo  n 


pn(v)  -  rMian  <  V) 


Let  us  now  impose  the  following  conditions. 


I1Q  exists  (I1Q  <  ®)  and  is  equal  to  {l1Q},  a*e*  (py  ^ 


For  v  e  (0,  110)»  we  have. 


tim  np  **  0 
_  n 

n-H» 


2>. 


The  conditions  (A)  are  relatively  mild,  and  they  are  satisfied  by  most  ergodic 
processes.  If  those  conditions  are  satisfied,  then  we  have  (Berk  [1]): 


£im  E  {n}  - 

Ct-H)  yl  Ey1tIlO^ 


llm  E  (N>  -  ;  for  6  ■  a  1 

yl  \U10* 


;  where  N  the  extended  stopping  variable  in  (20) , 
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We  note  that  conditions  (24)  are  also  satisfied  if  instead  of  conditions  (A), 
the  martingale  conditions  of  Chow  and  Robbins  [2],  or  the  strong  mixing  conditions 
of  Lai  {4]  hold.  From  (19)  and  (24),  we  thus  conclude  that,  for  6  00 ,  and  if  the 


conditions  (A)  hold,  the  expected  value  {Ng(w)}  does  not  exceed  \log  6| 

—1  ^  _ 

Therefore,  \tog  6|  •  E  (l^)  is  then  an  upper  bound  on  {^(w)}. 

1  1  a 

Let  C*  be  the  class  of  all  the  extended  stopping  variables  N  ,  satisfying 

o 

E  (N*>  >  5.  Let  us  define. 


\1(v- 


n*(5) 


-  inf  E  {N*} 
N*eC*  V1 


(25) 


Our  final  objective  will  be  to  prove  that  for  processes  and  that  satisfy 
conditions  (A),  we  have. 


llm  n*(6)  >  |£og  6j  •  (26) 

6-*00  l 


We  will  undertake  this  task  in  the  next  section.  In  the  remaining  of  this 
section,  we  will  present  a  useful  theorem  and  a  lemma. 


Theorem  2  (Wald  [11]) 

Let  N  be  the  sample  size  of  a  test  against  with  error  probabilities  a 
and  B  respectively.  Let  \iQ  and  be  stationary,  let  the  conditions  (A)  hold,  and 
let  I^Q  be  as  in  (22).  Then, 


E. 


!&>g 


fl(wo  > 


v-r1) 


>  (l-B)  £og  —■  +  B  log  >  (l-B)  \log  a |  -  log  2 


(27) 


Proof 

The  first  part  of  the  above  double  inequality  is  in  Wald  [11]. 
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The  lower  bound  in  the  above  inequality  is  due  to  0  log(l~ct)  being  nonnegative , 
and  due  to  the  fact  that  the  minimum  value  of  3  log  3  +  (1-3)  log  (1-3)  is  equal  to 
-log  2. 


Lenina  1 


Let  {a^}  and  {b^}  be  two  sequences,  such  that, 

00 

•ii# 


Then, 


bi  >  0  ;  *  l.  b4  >  bI+1i  »  1 


Z)  1  *i  bt  1  ‘life  ai  bJ 


(28) 


i»l 


Proof 

Define, 

OO 

cj  'ZX  bi ' 3  ^ 1 

i 

Then, 

«  00 

'i  -  «£>  t)  ■  ci  *i +  c£*i  ■  ai  bi  +Dai  bi  i 
1  2  2 

00 

-  bi^  ai  "  bi  ^  ^  ~b1  b1  <  0  - 

1 

00  00  00 

*  2*!  bi  ■  ciXX +  ci  *i  ‘ax  bi  i  ci  J2  ai 
2  2  2 


Similarly, 


ai  bi  -  Cj  S  ai  ~  C1  53  ai  J  i  ~  2 
J  j  j 


Now, 


QO  CO 


00  00 


E1  *i  bi  '£2*1  bi  ^Sci  E*i  ■  ci  E 1  *i 


k«l  k 


k-1  k 


and  the  proof  is  complete. 

4.  The  Infiaum  of  the  Stopping  Time 

In  this  section,  we  will  prove  that  for  stationary  processes  y"  and  y",  that 

satisfy  the  conditions  (A),  the  bound  in  (26)  holds.  We  will  do  that  via  a  theorem. 

In  the  proof  of  the  theorem,  theorem  2  and  lemma  1  will  be  used.  Note  that  if  the 

limit  I  in  (22)  exists,  and  the  processes  y?  and  y°°  are  also  ergodic,  then  the 
iu  l  o 

oo  00 

limit  I  n  is  the  mismatch  entropy  of  y.  with  respect  to  y  . 

10  1  o 


Theorem  3 

Let  be  the  class  of  all  extended  stopping  variables  N* ,  that  satisfy  the 

condition  E  {N*)\>  5.  Let  n*(5)  be  as  in  (25),  let  the  conditions  (A)  be  satisfied, 
*o 

and  let  be  as  in  (22).  Then, 


£im  n*(6)  ~ 
5-*» 


10 


Proof 

It  suffices  to  show  that  for  every  e  in  (0,1),  there  exists  c(e)  <  °°,  such 

A 

that  for  any  stopping  variable  N  in  Cg,  we  have. 
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By  {I10}  ‘  K  {N}  -  (l_e)  EU  {N}  "  C(£)  (29) 

H1  M1  o 


Step  1 

Given  e,  let  us  define  stopping  variables  T^;  i _>  0,  such  that. 


To  ■  0  •  Ti  *  Tl+1  <  ”  *  *  1 


^i+l  :  tke  8maHest  n  (or  «•  if  such  an  n  does 
not  exist)  such  that  <  n  and 


£1(WT1+1)  -  £  f«(w?  4.1 > 


o'T±+r 


From  Wald’s  [11)  argument,  we  have  that  P  (T.  <  •)  -  P  (decide  H  )  -  8. 

yl  1  ° 

Then  e  is  the  lowest  threshold  for  deciding  in  favor  of  H^,  which  in  Wald's  test 
is  larger  than  3(l-a)  1  >  8.  Thus,  P  0^  <  «*)  <  e.  Let  us  define. 


Drk  "  *wi  :  Tr-1  "  k*  ?  k  <  N 


(30) 


CD 

Let  us  denote  by  P^+^  the  probability  induced  by  the  measure  and  applied 

to  data  sequences  wfc+1,  wfc+2,  •••  •  Then,  provided  that  P^CD^)  >  0,  we  have 

due  to  the  arguments  above:  P,  ,,(T  <  “Id  ,  )  <  e.  On  the  set  D  ,  in  (30),  and 

k+1  r  ‘  rk  —  rk 

for  given  N  in  C^,  given  T^,  we  define  a  sequential  test  based  on  the  sequence 
Wk+1'  Wk+2*  ****  as  follows: 


Stop  at  min  (N,  T  )  • 


Decide  : 


Pk+1  s  if  N<xr 


P  ;  if  N  >  T 
^o 


The  number  of  observations  taken  for  the  above  test  is  min  (N,  T^)  -  k. 


,*  v  . 
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vfaose  conditional  expectation  pk+1HDrJt>  is  at  most  {N>.  To  this  point,  our 
dirIvationB  are  basically  as  in  the  proof  of  theorem  3  in  Lorden  [6],  with  the 
appropriate  modifications. 

Step  2 

Mow,  Wald'  inequality  (27)  in  theorem  2  holds,  on  D  .  with  a  -  P  (N  <  T  Id  ) 

rk  p  —  r1  rk 

nd  1-8  -  Pfc+1(H  <  Xr |Drk> .  So,  w.  hovo. 


f  >'  ' 

rrT1 


-  Pk+1(N  -  TrlDrk)  *  Pp  (N  -  TrlDrk)  I  -  2  1 

(31) 


>  (l-e)  |*og  Py  (N  <  TjD^)  I  -  tog  2 
o 

J  where  Hf  -  min  (N,  T^)  and  the  last  part  in  (31)  is  due  to, 

W  ^  Trl  V  i  Pk+l<Tr  *  "IV>  i  *■* 

Due  to  the  conditions  (A),  given  5  >  0,3  N  (£)  <  »: 


^ Jci-k)-1  £og  |  -  E  <1  >  I  <  £  ; 

'  fQ(»k+1)  ‘  “l  10 


V  i  >  k  +  N  (£) 
o 


(32) 


Due  to  (32),  we  now  obtain. 


o  k+1 

k+N  (O 


i  L  W"  1lDrk>  Ek+1  j  7~T~ 

1-k+l  (  VW 


N  -  i,  D 


rk  I 


+  IC  +  1  {I10)J  sup  ess  sup  T! 


i-k+N  (0+1 
o 


(i-W  Pk+1{» 


l|wj)  < 


°a>  (  f  w1 

<  I  W  •  U  +  E  {Il0}]  4-  £  Pl+1(N  -  1|  V  Ek+1  iog  „  .  t. 

1  1  i=k+l  olWk+l' 


f  -1  fl(w?  \ 

But,  due  to  cooditions  (A) ,  the  expected  values  E  )n  tog  - (  are 

.  ui(  1  f  («">  ) 

(  f  (w“)  l  o  1 

bounded  for  all  n.  If  this  bound  is  B,  then  E  \£og  — — — /  <  n  B  <  “;  for  n  <  “. 

_  x  f  (w")  ) 

/  *  /  k+nN |  )  o  1 


Therefore 


,  e  kg  vial  A 

A'  ov  k+l; 


is  bounded  a.e.  (P  ). 


Due  to  the  above  arguments,  and  since  Nq(  £)  is  finite,  we  conclude  that  there 
exists  some  finite  constant  C(£)  such  that. 


k+No(5) 


i-k+1 


.(“-ilD*)  \+i|^e  » ■  *• D* }  <  Cl° 


o'  k+r 


;  for  almost  all  w*  in  measure  y^. 


From  (33),  and  (34),  we  obtain. 


V"1  •  ‘5  +  V{Iio»i  4.  c(o 

1  v'Wi  1  1 


;  a.  e.  in  D^,  in  y^ 


-*•  i  (N>  •  E  {lin>  +  C?(C) 

5-0  H  *1  10 


Step  3 


Let  R  be  the  smallest  integer  r  _>  1  such  that  Tr  >  N;  where  {T^}  the  sequence 

in  step  1.  If  P  (R  _>  r)  >  0,  then  P  (R  <  r  +  1  |r  r)  is  well  defined  and  it 
^o  ^o 

equals  F  (N  <  T  |T  x  <  N) ,  which  is  an  average  over  k  of  the  probabilities 


%  -  ",  *v  •.  V  •_  .  -  •. 


16 


P  (N  <  T, |T^_^  ■  k  <  M),  satisfying  (31).  Therefore,  P  (R  >  r)  >  0,  and  convexity 
Mo 

of  - -tog  implies. 


Pk+l^Drk^ 


k-0 


J-  TT^IvU  (1-E)  l*»»  P„  <*  <  r+HR  > 

o(Wk+r 1  '  ° 


r)  |  -  -tog  2 


(36) 


If  P  (R  <  r+l|R  >  r)  >  Q;  r  _>  1,  such  that  P  (R  >  r)  >  0,  then  P  (R  >  r+1)  <  (1-Q) 


hence  R  {R}  <  Q  Thus ,  we  obtain  from  (36): 


k-0 


o  k+1 


E)  E  {R}  -  tog  2 
*o 


(37) 


We  observe  that  due  to  (35)  and  (37),  E  {r}  is  bounded,  that  is  E  {R}  < 

Po  Mo 

Also,  from  (35)  and  (37),  we  obtain. 


(1-e)  E  {R}  -  tog  2  <  E  {N}- [C  +  E  +  C(C)  - 

”o  **1  -LU 

♦  I  {N)*E.  {!,„}  +  C»  (Q 


£*0  M1 


U,  10' 


(38) 


Step  4 


The  sequence  {T^}  in  step  1  is  a  sequence  of  cumulative  sums  of  ergodic  and 

stationary  Integer  valued  random  variables,  under  the  probability  measure  y  .  We 

o 

have. 


*y  <tr>-E^  <R“  i}  Ey  {trIr“  i} 


0  w  ° 


"Epy  <R  "  1),Ey  {TilR  "  i}  *  1 


i-1 


(39) 
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But  the  expected  value  E  {T^|r  ■  l}  decreases  with  Increasing  i 


applies.  Thus,  we  obtain  from  (39), 


_  00  _  oo 


-  E  {R>  •  E  {T, } 

y  y  1 

O  O 


;  with  equality  if  the  processes  y^,  y^  are  memozyless,  which  is 
and  Lorden's  case. 

From  (40),  and  due  to  the  definition  of  R  in  step  3,  we  obtain, 

*08  E  00  <  log  E  {T  }  <  log  E  (rJ  +  -dog  E  {T  } 

*o  o  ”o  **o 

or 

*og  E  {R>  >  log  E  {N}  -  log  E  (Tx> 
o  %  o 

From  (38)  and  (41)  we  now  obtain  directly, 

(1-e)  log  E  (n)  -  (1-e)  log  E  {T  }  -  £og  2  < 

Po  yo  1 

<  1^00- [£  +  E^{I10}]  +  C(0 

e*  Vtf'VlM>*c,<0 

or 

\  {N}*Ey  {110}  -  (1-£>  <e°8  Ey  {N}  "  [(1-E)  En  {T1}  +  *°8  2  +  C’(i 
11  o  o 


But  E  {T^}  is  finite,  and  it  does  not  depend  on  N,  but  only  on  e 
*o 

Also  C' (£)  is  finite  and  independent  of  £.  We  can  thus  write 


and  lemma  1 

(40) 

Page's 

(41) 


>] 

(42) 

in  step  1. 


c(e)  -  (1-e)  dog  E  {T  }  +  log  2  +  C*  (O  <  » 
yo  1 

and  we  have  proven  inequality  (29). 

The  proof  of  the  theorem  is  now  complete. 


5.  Concluding;  Remarks 

Via  the  derivations  in  section  3,  and  theorem  3,  we  have  basically  proved  the 
following  theorem. 


Theorem  4 

Let  yo  and  y^  be  two  stationary,  ergodic,  and  mutually  independent  stochastic 

processes  with  memory.  Let  conditions  (A)  in  section  3  be  satisfied.  Consider  the 

extended  stopping  variables  N  ,  generated  by  testing  a  to  y^  change.  Let  Cg  be 

the  class  of  all  such  extended  stopping  variables  N*,  that  also  satisfy  the  condition 

E  {N*}  >  6.  Let  n*(5)  be  defined  as  follows, 

*o 


n*(6)  -  inf  S  {N*l 


N*eC* 


Vi 


Let  be  defined  as  follows. 


I 
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“  -dim  n  1  log 
n-*» 


Then,  due  to  conditions  (A),  exists  a.e. 
Furthermore, 


(P  )  and  is  equal  to  E  (I,). 
Vx  10 


•dim  n*(6)  ~ 
6-ko 


V1^ 


and  for  the  extended  stopping  variable  N.(w)  in  (7): 


19 


Urn  r  (N,(w) }  ~ 


*og  g 


T 


What  theorem  4  basically  says  Is  that,  the  sequential  test  described  by  the 

stopping  variable  (w)  in  (7),  is  asymptotically  (6-*00)  optimal  among  all  tests 

in  class  C*  in  the  theorem.  The  test  then  minimizes  the  expected  time  between  the 

occurence  of  a  y  to  y,  change  and  its  detection,  under  the  constraint  that  if  this 
o  l 

change  does  not  occur,  then  the  expected  time  for  a  false  alarm  (exceeding  the  upper 

threshold  6)  is  no  less  than  the  threshold  value  6. 

The  sequential  test  described  by  the  stopping  variable  N.(w)  in  (7) 

o 

operates  in  a  way  exhibited  by  expressions  (8).  The  statistic  T  (w11)  is  updated 

n  o 

sequentially  as  follows. 


W»fl> 


max 


(°-  V»” 


)  +  tog 


f„(wn+l'-o 


V\ 

»“)/ 


.  fl(“n+J“o)  »> 

The  updating  step  is  to g  - — -  ,  and  the  y  to  y.  change  is  decided 

V'Vil""’ 

the  first  time  N  that  the  statistic  Tit(wn)  exceeds  the  threshold  6. 

N  O 

We  note  that  the  conditions  (A)  for  optimality,  hold  for  a  large  class  of 

09  oq 

ergodic  and  stationary  processes.  As  an  example,  let  y  and  y  be  both  Gaussian 

1  o 

with  common  spectral  density  and  means  equal  to  0  and  0  respectively.  Let  R 

n 

denote  the  n-dimensional  covariance  matrix  induced  by  the  common  spectral  density, 
and  let  R  *  be  its  inverse.  Let  f  ^(w)  denote  the  spectral  density  induced  by 

li 


R  for  n-*».  Then,  if  f  *(w)  and  (2tt) 
n 


-x  e  £-i 

J-TT 


(<*))  sinto  [1-costd]  ^  dw  exist, 

f,  (w 


the  conditions  (A)  are  satisfied.  Furthermore,  the  updating  step  log 


i  n, 

f  (w  Iw11) 
o  n+1  o 


has  then  a  linear  from  0*p  (w  , .  -  h  (w  )].  The  constant  p  and  the  function 

n  n+1  no  n 

h  (v11)  can  be  updated  themselves  sequentially,  if  the  spectral  density  function  of 

n  o 

00  00 

the  processes  y^  and  yQ  has  convenient  form. 
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